The following datasets will be used in the course:


ahd

Real data on Alzheimer’s Disease.


airsat

Real data on 10,000 customers of an airline


attrition

Simulated data on 1470 fictional employees who either quit their job (attrition = yes) or did not (attrition = no).


globalWarm

Real data on emotions, ideology, and party affiliation as predictors of attitudes towards government action on climate change.


hcp_memory

Real neuroimaging data from the Human Connectome Project used to predict scores on a memory test. Note: artificially modified to increase predictive power and make activities more engaging.


heart

Real data on risk for heart disease.


iris

Real data on 150 iris flowers.


titanic

Real data on 963 passengers on the Titanic


water

Real data on 3276 different water bodies. Modified to turn Potability from a numeric variable (dummy code) into a character variable.

Copyright © 2022 Jeffrey Girard and Shirley Wang.